Accelerating the Development of Hardware Accelerators
نویسندگان
چکیده
ASC [1], A Stream Compiler, is designed to enable rapid development of hardware accelerators while still producing results that match hand-crafted equivalents. An ASC program represents a dataflow system which can be seen as a stream; to avoid the difficulties often associated with behavioural synthesis, ASC allows direct implementation of a hardware design based on the programmer’s input. Stream architectures are constructed using a C++ based object oriented approach, and as shown in Figure 1, a single ASC description can target Field Programmable Gate Arrays (FPGAs) and also other accelerator platforms, such as Graphics Processing Units (GPUs) and Sony Playstation 2 vector units. Architectural differences are transparent to the programmer in a basic implementation, requiring only the selection of an appropriate target architecture. FPGAs show excellent potential as hardware accelerators for a wide class of applications. While running many times slower than modern Pentium-type processors, the massive fine-grained parallelism and many independent memory buses of FPGAs can permit order-of-magnitude speed-ups over software. However, programming FPGAs remains essentially a hardware design task and is much harder than writing software. High level synthesis tools attempt to bridge the gap between algorithm and implementation, however the circuits that are obtained usually compare poorly with handcrafted implementations. While intended for graphics acceleration, modern GPUs are highly programmable parallel stream computation engines [4] and increasingly suitable for general computation [3]. The Sony Playstation 2 processor contains two programmable vector units that can execute independently, the ability of this system to accelerate applications is thus indicative of the potential of upcoming similar architectures such as Cell. Through targeting multiple different acceleration architectures from a common description, we can compare the performance of each architecture for a particular algorithm. Once the best accelerator architecture has been selected, architecture specific extensions can be used to further optimise the implementation. Developer access to multiple levels in the design hierarchy offers the ASC programmer great flexibility of implementation when required, permitting optimisation at the alGPU PS2 ASC GPU ASC ASC PS2
منابع مشابه
Introduction to Plasma Accelerators : the Basics
In this article, we concentrate on the basic physics of relativistic plasma wave accelerators. The generation of relativistic plasma waves by intense lasers or electron beams in low-density plasmas is important in the quest for producing ultra-high acceleration gradients for accelerators. A number of methods are being pursued vigorously to achieve ultra-high acceleration gradients using various...
متن کاملطراحی و شبیهسازی شتابدهنده داینامیترون 800 کیلو ولتی با نرم افزار CST
Nowadays, middle energy electrostatic accelerators in industries are widely used due to their high efficiency and low cost compared with other types of accelerators. In this paper, the importance and applications of electrostatic accelerators with 800 keV energy are studied. Design and simulation of capacitive coupling of a dynamitron accelerator is proposed. Furthermore, accelerating tube are ...
متن کاملHardware Acceleration for Neuromorphic Vision Algorithms
Neuromorphic vision algorithms are biologically inspired models that follow the processing that takes place in the primate visual cortex. Despite their efficiency and robustness, the complexity of these algorithms results in reduced performance when executed on general purpose processors. This paper proposes an application-specific system for accelerating a neuromorphic vision system for object...
متن کاملConfigurable and scalable class of high performance hardware accelerators for simultaneous DNA sequence alignment
A new class of efficient and flexible hardware accelerators for DNA local sequence alignment based on the widely used Smith–Waterman algorithm is proposed in this paper. This new class of accelerating structures exploits an innovative technique that tracks the origin coordinates of the best alignment to allow a significant reduction of the size of the dynamic programming matrix that needs to be...
متن کاملIMORC: An infrastructure and architecture template for implementing high-performance reconfigurable FPGA accelerators
The design, implementation and optimization of FPGA accelerators is a challenging task, especially when the accelerator comprises multiple compute cores distributed across CPU and FPGA resources and memories and exhibits data-dependent runtime behavior. In order to simplify the development of FPGA accelerators we propose IMORC, an infrastructure and architecture template that helps raising the ...
متن کاملLarge Fields for Smaller Facility Sources
Compared to conventional particle accelerators, plasmas can sustain accelerating fields that are thousands of times higher. To exploit this ability, massively parallel SciDAC particle simulations provide physical insight into the development of next-generation accelerators that use laser-driven plasma waves. These plasmabased accelerators offer a path to more compact, ultra-fast particle and ra...
متن کامل